A Study on Combined Effects of Reverberation and Increased Vocal Effort on ASR

نویسندگان

  • Hynek Bořil
  • Seyed Omid Sadjadi
چکیده

This study analyzes the individual and combined effect of room reverberation and increased vocal effort on automatic speech recognition. Robustness of several state-of-the-art front-end feature extraction strategies and normalizations to these sources of speech signal variability is evaluated in the context of large and small vocabulary recognition tasks on American English and Czech speech corpora. For the large vocabulary task, speech material from the UT-Scope database comprising American English utterances is used. The Czech speech samples are drawn from the CLSD‘05 data corpus and used for the small vocabulary tasks. Both databases contain neutral as well as increased vocal effort recordings. Simulated reverberant test conditions are generated using measured room impulse responses from the AIR database and utilized in the evaluations. It is shown that the robustness of a common MFCC front-end to reverberation and increased vocal effort can be considerably improved when paired with cepstral gain normalization and modified RASTA filtering. A combination of recently proposed mean Hilbert envelope coefficients and modified RASTA is found to provide balanced performance across all reverberation and vocal effort conditions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effect of Combined Training Protocol on Postural Control and Motor Functions of Individuals with Multiple Sclerosis

Background & Objective:  Multiple Sclerosis (MS) is a common chronic inflammatory disease of the central nervous system. Postural control and motor function disorders are the most common MS related symptoms. Currently, exercise therapy seems to be the most effective non-pharmacological approach in controlling and improving these disorders. Thus, the present study intends to study the effect of ...

متن کامل

Long-Term Reverberation Modeling for Under-Determined Audio Source Separation with Application to Vocal Melody Extraction

In this paper, we present a way to model long-term reverberation effects in under-determined source separation algorithms based on a non-negative decomposition framework. A general model for the sources affected by reverberation is introduced and update rules for the estimation of the parameters are presented. Combined with a wellknown source-filter model for singing voice, an application to th...

متن کامل

Effect of Reverberation Time on Vocal Fatigue

Vocal effort is a physiological magnitude that accounts for the changes in voice production that occur as vocal loading, which is the stress inflicted on the vocal folds when speaking for long periods, increases. It has been quantified in terms of Sound Pressure Level (SPL). In previous research, it has been shown that prolonged vocal effort can lead to vocal fatigue. An experiment was conducte...

متن کامل

Invariant-integration method for robust feature extraction in speaker-independent speech recognition

The vocal tract length (VTL) is one of the variabilities that speaker-independent automatic speech recognition (ASR) systems encounter. Standard methods to compensate for the effects of different VTLs within the processing stages of the ASR systems often have a high computational effort. By using an appropriate warping scheme for the frequency centers of the timefrequency analysis, a change in ...

متن کامل

The Effects of Size and Type of Vocal Fold Polyp on Some Acoustic Voice Parameters

Background: Vocal abuse and misuse would result in vocal fold polyp. Certain features define the extent of vocal folds polyp effects on voice acoustic parameters. The present study aimed to define the effects of polyp size on acoustic voice parameters, and compare these parameters in hemorrhagic and non-hemorrhagic polyps.Methods: In the present retrospective study, 28 individuals with hemorrha...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012